Open Problems 1 When Is Small Beautiful ?

نویسندگان

  • Amiran Ambroladze
  • John Shawe-Taylor
چکیده

Is the textbook result that Solomonoff’s universal posterior convergesto the true posterior for all Martin-Löf random sequences true?Universal Induction Induction problems can be phrased as sequence predic-tion tasks. This is, for instance, obvious for time series prediction, but alsoincludes classification tasks. Having observed data xt at times t < n, the task isto predict the t-th symbol xt from sequence x = x1...xt−1. The key concept toattack general induction problems is Occam’s razor and to a less extent Epicu-rus’ principle of multiple explanations. The former/latter may be interpreted asto keep the simplest/all theories consistent with the observations x1...xt−1 andto use these theories to predict xt. Solomonoff [4, 5] formalized and combinedboth principles in his universal prior M(x) which assigns high/low probability tosimple/complex environments, hence implementing Occam and Epicurus. M(x)is defined as the probability that a universal Turing machine U outputs a stringstarting with x, when provided with fair coin flips on the input.Posterior Convergence Solomonoff’s [5] central result is that if the probabil-ityμ(xt|x1...xt−1) of observing xt at time t, given past observations x1...xt−1is a computable function, then the universal posterior Mt :=M(xt|x1...xt−1)converges (rapidly!) with μ-probability 1 (w.p.1) for t → ∞ to the true poste-rior μt :=μ(xt|x1...xt−1), hence M represents a universal predictor in case ofunknown μ. Convergence of Mt to μt w.p.1 tells us that Mt is close to μt for suf-ficiently large t for “almost all” sequences x1:∞ (we abbreviate x1:n := x1...xn).It says nothing about whether convergence is true for any particular sequence(of measure 0).Martin-Löf Randomness is the standard notion to capture convergence forindividual sequences and is closely related to Solomonoff’s universal prior. Levingave a characterization equivalent to Martin-Löf’s (M.L.) original definition [2]:A sequence x1:∞ is μ-random (in the sense of M.L.) iff there is a constant csuch thatM(x1:n) ≤ c · μ(x1:n) for all n.One can show that a μ-random sequence x1:∞ passes all thinkable effectiverandomness tests, e.g. the law of large numbers, the law of the iterated logarithm,etc. In particular, the set of all μ-random sequences has μ-measure 1.Open Problem An interesting open question is whether Mt converges to μt(in difference or ratio) individually for all Martin-Löf random sequences. Clearly,Solomonoff’s result shows that convergence may at most fail for a set of sequenceswith μ-measure zero. A convergence result for μ-random sequences is particularlyinteresting and natural in this context, since μ-randomness can be defined interms of M itself (see above). 11 IDSIA, Galleria 2, CH-6928 Manno-Lugano, Switzerland [email protected] http://www.idsia.ch/~marcus12 A prize of 128 Euro for a solution of this problem is offered. Proof Attempts Attempts to convert the convergence results w.p.1 to effectiveμ-randomness tests fail, since Mt is not lower semi-computable. In [3, Th.5.2.2]and [6, Th.10] the following Theorem is stated:“Let μ be a positive recursive measure. If the length of y is fixed and thelength of x grows to infinity, then M(y|x)/μ(y|x) → 1 with μ-probability one.The infinite sequences ω with prefixes x satisfying the displayed asymptotics areprecisely [‘⇒’ and ‘⇐’] the μ-random sequences.”While convergence w.p.1 is correct if appropriately interpreted, the proofthat convergence holds for μ-random sequences is incomplete:“M(x1:n) ≤ c·μ(x1:n)∀n ⇒ limn→∞ M(x1:n)/μ(x1:n) exists” has been used, butnot proven, and may indeed be wrong. Vovk [7] shows that for two finitely com-putable semi-measures μ and ρ, and x1:∞ being μand ρ-random that ρt/μt → 1.If M were recursive, then this would imply Mt/μt → 1 for every μ-random se-quencex1:∞, since every sequence is M -random. Since M is not recursive Vovk’stheorem cannot be applied and it is not obvious how to generalize it. So thequestion of individual convergence remains open.Conclusions Contrary to what was believed before, the question of posteriorconvergence Mt/μt → 1 (also Mt → μt) for all μ-random sequences is still open.In [1] we introduce a new flexible notion of randomness which contains Martin-Löf randomness as a special case. This notion is used to show that standardproof attempts of Mt/μtM.L.−→ 1 based on so called dominance only must fail,indicating that this problem may be a hard one.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Tabu Search Method for a New Bi-Objective Open Shop Scheduling Problem by a Fuzzy Multi-Objective Decision Making Approach (RESEARCH NOTE)

This paper proposes a novel, bi-objective mixed-integer mathematical programming for an open shop scheduling problem (OSSP) that minimizes the mean tardiness and the mean completion time. To obtain the efficient (Pareto-optimal) solutions, a fuzzy multi-objective decision making (fuzzy MODM) approach is applied. By the use of this approach, the related auxiliary single objective formulation can...

متن کامل

مدل‌سازی شبکه‌های چندترمیناله در سیستم‌های قدرت با استفاده از روش برازش برداری به‌منظور تحلیل حالات گذرای الکترومغناطیسی فرکانس بالا

Modeling of frequency-dependent components of power system is often based on a terminal description by an admittance matrix in the frequency domain. One challenge in the extraction of state-space models from such data is to prevent possible error magnification when the model is to be applied in time-domain simulations. The error magnification is a consequence of inaccurate representation of sma...

متن کامل

Distribution of Residues Modulo p

The distribution of quadratic residues and non-residues modulo p has been of intrigue to the number theorists of the last several decades. Although Gauss’ celebrated Quadratic Reciprocity Law gives a beautiful criterion to decide whether a given number is a quadratic residue modulo p or not, it is still an open problem to find a small upper bound on the least quadratic non-residue mod p as a fu...

متن کامل

An electromagnetism-like metaheuristic for open-shop problems with no buffer

This paper considers open-shop scheduling with no intermediate buffer to minimize total tardiness. This problem occurs in many production settings, in the plastic molding, chemical, and food processing industries. The paper mathematically formulates the problem by a mixed integer linear program. The problem can be optimally solved by the model. The paper also develops a novel metaheuristic base...

متن کامل

Arhangel’skĭı’s Solution to Alexandroff’s Problem: a Survey

In 1969, Arhangel’skĭı proved that for every Hausdorff space X, |X| ≤ 2χ(X)L(X). This beautiful inequality solved a nearly fifty-year old question raised by Alexandroff and Urysohn. In this paper we survey a wide range of generalizations and variations of Arhangel’skĭı’s inequality. We also discuss open problems and an important legacy of the theorem: the emergence of the closure method as a fu...

متن کامل

Minimizing the total tardiness and makespan in an open shop scheduling problem with sequence-dependent setup times

We consider an open shop scheduling problem with setup and processing times separately such that not only the setup times are dependent on the machines, but also they are dependent on the sequence of jobs that should be processed on a machine. A novel bi-objective mathematical programming is designed in order to minimize the total tardiness and the makespan. Among several mult...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003